GH-108866: Change optimizer API and contract #109144

markshannon · 2023-09-08T15:11:36Z

Partial implementation of #108866

Changes the return type of the execute function from _PyInterpreterFrame * to _Py_CODEUNIT *, returning the next instruction to execute
The execute function always executes at least one instruction.

Since the uop and optimizer and executor start at the top of the loop, starting at the JUMP_BACKWARD means that they always execute one or more instructions.
ENTER_EXECUTOR is more efficient as it doesn't need to worry about edge cases like executing no instructions or handling INSTRUMENTED_LINE.

Issue: We need to change the contract and interface of _PyExecutorObject and _PyOptimizerObject #108866

…execute at least one instruction.

gvanrossum

The more I think about this, the more I think that the problem with signature of the executor is that it has too many return values:

An error indicator
The frame where the error (or exit) occurred
The instruction pointer where it occurred
The stack pointer at that time

Both the frame and the instruction pointer are needed to continue execution and also for error handling, and both can differ from the value before the executor runs. Same for the stack pointer (though we always sync that through the frame object).

Things would be neater if these were all explicit and we didn't have to recover one or the other through tstate->current_frame.

The optimizer inherits this awkwardness because it promises to also run the executor. (And why is that? It seems to make things more complicated, and it means the opcode isn't replaced until after that first run -- won't that confuse other threads?)

The requirement to include the JUMP_BACKWARD instruction in the input to the optimizer causes a bunch of extra work which I'm also not keen on. And, despite what I said in in gh-108866, I actually still don't completely understand why we need this.

In any case maybe we need to split the two changes.

Python/bytecodes.c

gvanrossum · 2023-09-08T23:43:08Z

Python/bytecodes.c

+                while(oparg > 255) {
+                    oparg >>= 8;
+                    src--;
+                }


Clever. I'd add assert(src->op.code == EXTENDED_ARG).

I don't think this assert is correct, since it could be an INSTRUMENTED_LINE or something. Ditto for the matching assert in _PyOptimizer_BackEdge.

Ouch. Can instrumentation really overwrite EXTENDED_ARG? @markshannon

Python/optimizer.c

markshannon · 2023-09-11T15:49:08Z

The frame pointer, instruction pointer and stack pointer are all part of the VM state.
We either need to store them in memory, or pass them as an argument/return value when calling or returning from a C function (or jitted code).

The only reason we passed these values as arguments or return value was for efficiency. It reduces the number of memory accesses at the point of transfer. If it turns out that most callees don't need them, we can just remove the parameter from the API.

The difference with this PR is that the return value is meaningful, not just a cached value. The next_instr cannot be derived from the prev_instr stored on the frame. If the last instruction was a jump then next_instr != prev_instr + 1.

Things would be neater if these were all explicit and we didn't have to recover one or the other through tstate->current_frame.

tstate->current_frame is the current frame and holds the previous instruction and stack pointers.
Unless we are passing values in registers for performance, we don't want to make copies that could get out of sync.

gvanrossum · 2023-09-11T18:01:28Z

Thanks for the changes.

I believe we decided offline to hold off on this until Brandt has had an opportunity to change _PyOptimizer_BackEdge not to call the executor, and to modify JUMP_BACKWARD instead to "deoptimize" to ENTER_EXECUTOR. (Or to find out that there was a reason why things are the way they are.)

markshannon · 2024-01-18T17:04:50Z

This is now obsolete

markshannon added 2 commits September 7, 2023 04:47

Change optimizer API and contract to return Py_CODE_UNIT* and always …

4dbfa49

…execute at least one instruction.

Add news

ca9550b

bedevere-bot added the awaiting core review label Sep 8, 2023

markshannon requested a review from gvanrossum September 8, 2023 15:11

bedevere-bot mentioned this pull request Sep 8, 2023

We need to change the contract and interface of _PyExecutorObject and _PyOptimizerObject #108866

Closed

gvanrossum reviewed Sep 9, 2023

View reviewed changes

Address review comments

2396108

markshannon mentioned this pull request Oct 25, 2023

gh-109094: replace frame->prev_instr by frame->instr_ptr #109095

Merged

markshannon closed this Jan 18, 2024

markshannon deleted the optimizer-return-next-instr branch January 18, 2024 17:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GH-108866: Change optimizer API and contract #109144

GH-108866: Change optimizer API and contract #109144

markshannon commented Sep 8, 2023 •

edited by bedevere-bot

Loading

gvanrossum left a comment

gvanrossum Sep 8, 2023

brandtbucher Sep 12, 2023

gvanrossum Sep 12, 2023

markshannon commented Sep 11, 2023

gvanrossum commented Sep 11, 2023

markshannon commented Jan 18, 2024

GH-108866: Change optimizer API and contract #109144

GH-108866: Change optimizer API and contract #109144

Conversation

markshannon commented Sep 8, 2023 • edited by bedevere-bot Loading

gvanrossum left a comment

Choose a reason for hiding this comment

gvanrossum Sep 8, 2023

Choose a reason for hiding this comment

brandtbucher Sep 12, 2023

Choose a reason for hiding this comment

gvanrossum Sep 12, 2023

Choose a reason for hiding this comment

markshannon commented Sep 11, 2023

gvanrossum commented Sep 11, 2023

markshannon commented Jan 18, 2024

markshannon commented Sep 8, 2023 •

edited by bedevere-bot

Loading